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This memo originally bad t«0 parts- The 
first dealt with certain deficiencies in 
art early version of CuEmao’s program, SEE.- 
The problems have b*6n ftxedj and tht cor¬ 
responding discussion h&s been dropped from 
this memo. The part remaining deals with 
line drawings of objecta with ho1684 
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Fart II: tic-leg- in Obi sets 

Guzman's program docs not work very well on holes, The scene of 
figure 10 ie identified as two objects. Humana do a little better but 
da flat know how. If ant select* a typical human and asks hov be is sure 
something is a hole rather than a second object, he usually complains that 
it just cannot be any abject. 



Figure JCL A hale. 


I do not think we identify holes through elimination of other candi¬ 
dates. But then the question is What is it in a conglomeration of Hues 
that we Call a hole such that we do indeed Call it a hole + To answer 
this question„ I examine a numbcT of obscured holes to determine what is 
sufficient for the identification of holeyflcss. 

In figure H, ofle sees that the arrow alone does not help much. The 
arrow might represent a hcLe, but an upended brick or wedge are also 


possible. 



Figure 11, The arrow of a hole. 












The arrow together wifi an L are better. Examine figure 11. ■On 
the right one has 4 hole, while On the left one has a brick resting On a 
wedge. The only difference is the direction of the arrowed line* Figure 
13 shows intermediate cases. Most people agree that on the right one has 
an oddly shaped hole, while on the left* a lopsided brick, The emergent 
role seens to be: If the direction of the further line of the L is 
parallel to the center line of the fork, then no hola; if the direction 
is parallel to the other line of the fork, then a hole is quite likely; 
intermediate directions indicate situations between these bounds. 



Figure 12. A hole and a Brick, 


*The pyramid and wedge are also possible, but only if viewed from a degan-* 
arate angle; notice that the Lines observed to be parallel lose that feature 
under perturbation except in the case of a hole. 
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Figure 13. A peot bole and a poor brick. 



Figure 1A + Adding a fork* 





















4 


Adding the bottom of the- hold docs not add much ^ as one can seo in 
figure 14. In end case we have a hole; In another, an upended brick; and 
without the L, complete ambiguity just as before* Naturally in the CSSS 
o£ holes with bottoms it is useful to do a simple check to see if the hole 
goes all the way through. 

Turning to the solo I joint as seen in figure 15, one notes first of 
all that Gusman's prograE fails to bind* Huuans have a hard time too. 
Identification in these cases seems to be .global and vague, perhaps since 
SO little evidence is present, 

Irt any ease, the explanation of the through line of the T decides the 
case. If it makes sense as an edge of sOmO known object, there is probably 
no hole; otherwise, hole* Hew to derive the ntcOss&ry £3(pLanation is She 
next question, and 1 have no good answer CO that yet* Attaching an L to 
the T does not change the test. A scene similar to that of figure 15 can 
be drawn to support this conclusion. Figure 16 shows that a fork also makes 
no Change, Again the decision between hole and obscured object liea in 
the explanation of the through lino 0-f the T. 

Finally, with unobscured holes (figure 10), ®ne has all of these 
factors working together, Moreover, the connection of the T to the arrow 
via the L is local evidence that the through line of the T is not an edge 
of some obscuring body, 

****** 

1 snbE.it that correct identification of the parts of the scene in 
figure 1? would be a good final exam for a hole program, Guzman 1 $ 
program would suggest 1-2-3 and 4-5-* j 7-S-0-J.0 as objects using bottom of 
the barrel heuristics, 1 think the ideas of Fart 1 and Part II allied 
with a lot of sweat and cursing eould hsndl* the job- 





Figure 16. Adding a fork. 



























Figure 17- The final exart*. 
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Fart I; Holes in Guzman's Pr&gtam 
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Guzman's program amalgamates regions into objects adroitly and, 
for the most part, the resultant -partitionings are in harmony with 
human tastes. This is particularly Impresstve since the knowledge of 
the world built into the program is so slight. 

But while the program is a remarkable achievement indeed T it can 
be fooled by scenes with which humans have no trouble, Gusman himself 
reports some situations in which the program is too conservative and 

irk 

fails to unite all parts of certain objects. Beyond these, it is also 
possible to find situations in which the program is too liberal. Consider 
figure 1, Surprisingly, the three blocks are reported as a single object I 
The fault lies with the two fork joints indicated by the arrows. These 
joints create spurious links which are suffielent to bind everything 
together, 



Figure 1. A bad seen* f&r Guzman's program. 













The scene in figure 2a also causes a blunder* Hare the same linkage 
combination that hinds Tegions 1 and 2 of the wedge el go hinds region 2 tc 
region 3. Both objects axe reported as a single object. Curiously, 
turning the wedge around, as In figure 2b, eliminates the problem. 
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strong lints: ^ 
weak links; 


Figure 2, Another bad scene. 

There are at least three possible approaches to improvement: turning 
the program* providing the program with more information, and Integrating 
the program with an object recognizer. Tuning refers to adding to or 
slightly modifying the basic heuristics of the program. This seems fruit¬ 
less. Host ideas that help the program over these examples foul it up 
soBcwhsre else. 

Giving the program more information has promise in situations where 
a scene happens tn be recorded from a bad angle* By this 1 mean that 
the program would be cautioned against paeodo-joints of various kinds 
whose hoax tan be exposed through inflntesimal perturbation of the viewer's 
angle- hence the T joints in figure 3 would be identified as degenerate . 
since perturbation of the Viewer's angle changes them into an arrow and a 
fork. On the Other hand, Che Is of figure 4 are pe raistaiit since they 
remain Ts under any inf Intestinal perturbation of the viewer's angle* 












Figure Degenerate Ts 



Figure 4. Persistant Tb, 




















Iti caaCS whet* perturbation Of the degenerate T results In an arrow 
or fork, the appropriate faints «iiW, of course. be effected with the 
usual confidence. In other cues such Ts would yield misleading linkage 
*ere they not Identified as degenerate. Joints indicated by arrows In 
figure 5 Illustrate this point. Checking the validity of the five other 
joints that cause binding should be similarly helpful. 



Figure 5- More degenerate ts. 

The third way to improve performance la more radical, difficult, *nd 
potentially powerful. Rather than organize the system in the vertical 
oheln of figure & a one would marry region partitinner and object identifier 
Ifhen the object Identifier finds a proposed group of regions unseemly, 
control would pass to a routine that triesto split the group into more 
palatable subgroups which are again checked by the object identifier. In 
the case of figure 1, such a program would have a good hint. «otice that 
the linkage pairs that bind up 1-3-3, 4-5-6. Wid W a11 •“ *"° SS ’ 
single line as followss 





















Eyt the linkage pair that bind a 3-7 and Clauses the trouble involves 
two separated linej, Tills seecr.fi weaker. Noticing this, the regrouping, 
program would correctly propose 1-2-3, 4“5-6, and ?''8-9 as objects, which 
in turn would bo confirmed by the object identifier, 

<r 



Figure 6, Elementary vision system organisation. 

This particular heuristic is not useful in the case of figure 2a 
Since the same kind of joint both correctly binds together the wedge and 
incorrectly hinds the wedge to the block. The K Joint might save the day 
here (Gusman^s program, does not use the K joint at all), I think the K 
offers evidence that region 3 should be bound to either 2 or 4 but not 
both. Given this, the regrouping program would try two alternatives. One 
would be to add a link between 2 and 3 while breaking the link between 3 
and 4, This hrings no improvement. The other possibility would be to 
add a link between 3 and 4 and break those between 2 and 3. Victory, 

The region, identifier discovery a wedge and a brick. 










The regrcuper should also ha equipped to tty binding regions on the 
strength of a Cmnfnon edge, particularly when this involves a region that 
is otherwiEe unbound, This would allow the object of figure 7 to be 
identified rts a wedge rather than two single-region objects- 



Figure 7. A troublesome wedge- 


Eesaarhs on the nature of an object identifier must be even more 
ays£leal at this point. One or two not very well substantiated views 
dsay be worth &enttoning, however. 

First, I do not think extension of occluded lines is a very good way 
to identify obscured objects- For one thing> there are cases in. which the 
line extender would have to propose more than one extension possibility. 
Figure 13, to be seen in Fart IX, displays an example of such a scene. 
Secondly, reasonably extended lines often do not help much. Figure & 
looks like two blocks, but it is difficult to inagine how line extension 
could contribute to that conclusion. 



Figure £. Two blocks. 














Instead, I think recognition should he done by a description matching 
routine capable Of matching, observed features against models while excusing 
absent features that may logically be obscured. 

Finally, these Bodels might he arrayed in s preference list. Higher 
positions on the list would be reserved for several object types: simple 
objects 1 objects the program is told to look for; and objecte already found 
in the surround + An unknown would be compared to these in turn. Thus the 
obscured object in figure 9 could he identified as a wedge or cube depending 
on which the program ii looking for or which ptedooinates in the rest of 
the scene. 

If no match is found on the list, a regrouping of regions could he 
tried or the object could be described as a near match with sore flaw. 



Figure 9, Use of the preference list. 





